A neural mechanism for binaural pitch perception via ghost stochastic resonance 
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We present a physiologically plausible binaural mechanism for the perception of the pitch of 
complex sounds via ghost stochastic resonance. In this scheme, two neurons are driven by noise and 
a different periodic signal each (with frequencies /i = kfo and fa = (fc+l)/o, where k > 1), and their 
outputs (plus noise) are applied synaptically to a third neuron. Our numerical results, using the 
Morris-Lecar neuron model with chemical synapses explicitly considered, show that intermediate 
noise levels enhance the response of the third neuron at frequencies close to fo, as in the cases 
previously described of ghost resonance. For the case of a inharmonic combination of inputs (/i = 
kfo + A f and / 2 = (fe + l)/o + A/) noise is also seen to enhance the rates of most probable spiking 
for the third neuron at a frequency f r = fo + fc ^jy 2 • ^ n addition, we show that similar resonances 
can be observed as a function of the synaptic time constant. The suggested ghost-resonance-based 
stochastic mechanism can thus arise either at the peripheral level or at a higher level of neural 
processing in the perception of pitch. 



The perception and processing of environmental 
complex signals resulting from the combination 
of multiple inputs is a nontrivial task for the ner- 
vous system. In many species, solving efficiently 
this sensory problem could have an evolutionary 
payoff. A classical example is the perception of 
the pitch of complex sounds by the auditory sys- 
tem, the mechanism of which remains controver- 
sial. Recently, a mechanism for the perception 
of pitch has been proposed on the basis of the 
so-called ghost stochastic resonance. Under this 
paradigm, an appropriate level of noise yields an 
optimal subharmonic neural response to a combi- 
nation of two or more harmonic signals that lack 
the fundamental frequency, which is nevertheless 
perceived by the system. The original proposal 
concentrated in the peripheral level of the per- 
ception process, by considering the case of a sim- 
ple monoaural presentation of the complex sig- 
nal. On the other hand, it is known that com- 
plex sounds are also perceived when its two con- 
stituent tones are presented binaurally (i.e. one 
in each ear). Thus, the question that remains is 
whether ghost stochastic resonance can partici- 
pate in detecting this "virtual" dichotic pitch at 
a higher level of processing. In this paper we 
present, on the basis of numerical simulations, a 
plausible mechanism for the binaural perception 
of the pitch of complex signals via ghost stochas- 
tic resonance. In this scenario, each of the two 
input tones drives a different noisy neuron (cor- 
responding to detection in the left/right auditory 
pathways), and together they drive a third noisy 
neuron that perceives the missing fundamental. 



'Electronic address: pablo.balcnzucla@upc.es also at Fundacion 
Antorchas, Chile 300 (1098), Buenos Aires, Argentina. 
^Electronic address: jordi.g.ojalvo@upc.es 



In this way, the same basic mechanism of ghost 
resonance can explain pitch perception occurring 
at both the peripheral and a higher processing 
level. 



I. INTRODUCTION 

A. Pitch perception by single neurons 

Under many conditions sensory neurons can be consid- 
ered as noisy threshold detectors, responding to external 
signals (either from the environment or from other neu- 
rons) in an all-or-none manner. Substantial effort has 
been dedicated to examine theoretically and numerically 
the response of neurons to simple input signals, usually 
harmonic, both under deterministic Q, Q and stochastic 
conditions. 

Much less studied is the case of multiple input signals. 
It is known, for instance, that a high frequency signal 
enhances the response of a neuron to a lower frequency 
driving via vibrational resonance ,5| ■ On the other hand, 
two-frequency signals are commonly used for diagnostic 
purposes, such as in the analysis of evoked potentials in 
the human visual cortex |(|, but the detection and pro- 
cessing of this type of combined signals is poorly under- 
stood. Recently, a study of the response of a neuron to 
a combination of harmonics in which the fundamental is 
missing has shed new light upon the problem of the 
perception of the pitch of complex sounds Q . 

The perceived pitch of a pure tone is simply its fre- 
quency. In contrast, the perceived pitch of a complex 
sound (formed by a combination of pure tones) is a sub- 
jective attribute, which can nevertheless be quantified 
accurately by comparing it with a pure tone. In the par- 
ticular case of harmonic complex sounds (signals whose 
constituent frequencies are multiple integers of a funda- 
mental frequency), the perceived pitch is the fundamen- 
tal, even if that frequency is not spectrally present in the 
signal. For that reason, the pitch is usually referred to 
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in this case as a "virtual pitch", and its perception is 
sometimes called the "missing fundamental illusion" . 

The neural mechanism underlying pitch perception re- 
mains controversial. From a neurophysiological perspec- 
tive, the perceived pitch is associated with the inter- 
spike interval statistics of the neuronal firings 0, fiof . 
The analysis presented in Refs. 0, @ shows that a neu- 
ron responds optimally to the missing fundamental of 
a harmonic complex signal for an intermediate level of 
noise, making use of two ingredients: (i) a linear inter- 
ference of the individual tones, which naturally leads to 
signal peaks at the fundamental frequency, and (ii) a 
nonlinear threshold that detects those peaks (with the 
help of a suitable amount of noise, provided the signal 
is detcrministically subthreshold). The behavior of this 
relatively simple model yields remarkably good agree- 
ment with previous psychophysical experiments 0] ■ The 
phenomenon has been termed ghost stochastic resonance 
(GSR), and has been replicated experimentally in ex- 
citable electronic circuits 0] and lasers |l3j. 



B. Signal integration and processing of distributed 
inputs 

Besides the question of how pitch is perceived, an- 
other contested debate relates to where perception takes 
place. Although interval statistics of the neuronal fir- 
ings 0, show that pitch information exists in pe- 
ripheral neurons, other results seem to indicate that, at 
least to some extent, pitch perception takes place at a 
higher level of neuronal processing |14| . A typical exam- 
ple is found in binaural experiments, in which two com- 
ponents of a harmonic complex signal enter through dif- 
ferent ears. It is known that in that case a (rather weak) 
low-frequency pitch is perceived. This is called "dichotic 
pitch", and can also arise from the binaural interaction 
between broad-band noises. For example, Cramer and 
Huggins 0] studied the effect of a dichotic white noise 
when applying a progressive phase shift across a narrow- 
band of frequencies, centered on 600 Hz, to only one of 
the channels. With monaural presentation listeners only 
perceived noise, whereas when using binaural presenta- 
tion over headphones, listeners perceived a 600-Hz tone 
against a background noise. 

It is worth examining whether the ghost resonance 
mechanism introduced by Chialvo et al. 0, @ can also 
account for the binaural effects described above. Ghost 
resonance has already been seen to be enhanced by cou- 
pling in experiments with diffusively coupled excitable 
lasers ^|| , but no studies in synaptically coupled neurons 
have been made so far. Given that chemical synapses 
lead to pulse coupling, a reliable coincidence detection 
is required in order for ghost resonance to arise in this 
case. In what follows we examine the situation in which 
two different neurons receive one single component of the 
complex signal each (so that each neuron represents de- 
tection at a different auditory channel in a binaural pre- 



sentation), and act upon a third neuron which is expected 
to perceive the pitch of the combined signal. Our results 
show that this higher-level neuron is indeed able to per- 
ceive the pitch, hence providing a neural mechanism for 
the binaural experiments. 



II. MODEL DESCRIPTION 

A. Neuron Model 

We describe the dynamical behavior of the neurons 
with the Morris-Lecar model [l7|. 
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where Vi and Wi stand for the membrane potential and 
the fraction of open potasium channels, respectively, and 
the subindex i labels the different neurons, with i = 1 , 2 
representing the two input neurons and i = 3 denoting 
the processing neuron. C m is the membrane capacitance 
per unit area, is the external applied current, J? yn 
is the synaptic current, and the ionic current is given by 
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where g a (a = Ca, K, L) are the conductances and V® 
the resting potentials of the calcium, potassium and leak- 
ing channels, respectively. The following functions of the 
membrane potential are also defined: 
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where Vmi, Vm2, Vwi an d Vw2 are constants to be spec- 
ified later. The last term in Eq. is a white noise term 
of zero mean and amplitude Di, uncorrelated between 
different neurons. 

In the deterministic and single-neuron case, this sys- 
tem shows a bifurcation to a limit cycle for increasing ap- 
plied current 7 app 0] . This bifurcation can be a saddle- 
node (type I) or a subcritical Hopf bifurcation (type II) 
depending on the parameters. We chose this last option 
for the numerical calculations presented in this paper. 
The specific values of the parameters used in what fol- 
lows are shown in table |QJ 0|. The equations were in- 
tegrated using the Heun method [2l| , which is equivalent 
to a second order Runge-Kutta algorithm for stochastic 
equations. 
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B. Synapsis model 

In this work we couple the neurons using a simple 
model of chemical synapsis [2(J. In this model, the synap- 
tic current through neuron i is given by 



r s y n _ 
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(7) 



where the sum runs over the neighbors that feed neuron i, 
g| yn is the conductance of the synaptic channel, rj stands 
for the fraction of bound receptors of the postsynaptic 
channel, V% is the postsynaptic membrane potential, and 
E s is a parameter whose value determines the type of 
synapsis (if larger than the rest potential, e.g. E s = 
mV, the synapsis is excitatory; if smaller, e.g. E s — 
—80 mV, it is inhibitory). 

The fraction of bound receptors, r i; follows the equa- 
tion 



(8) 



where [T]i — 6(TQ+T syn — t)6(t — TQ) is the concentration 
of neurotransmitter released in the synaptic cleft, a and 
are rise and decay time constants, respectively, and Tq is 
the time at which the presynaptic neuron (labeled now i) 
fires, what happens whenever the presynaptic membrane 
potential exceeds a predetermined value, in our case cho- 
sen to be 10 mV. The time during which the synaptic 
connection is active is given by r syn . The values of the 
parameters that we use, specified below, were taken from 
|2Cj and could be read from table |Q}. 



III. THE CASE OF DISTRIBUTED HARMONIC 
COMPLEX SIGNALS 

As mentioned above, we consider a network of three 
neurons organized in two layers. The first layer is com- 
posed of two units (called "input neurons" ) that receive 
the external inputs, and whose responses act upon the 
processing layer, composed in this case of only one unit 
(called "processing neuron") . The coupling is unidirec- 
tional from each of the input neurons to the processing 
neuron. Of course, physiological realism dictates that 
more than three neurons will be present. However, we 
model here for simplicity the simplest possible case; one 
can expect that adding more neurons will only improve 
the results. 

In order to analyze the global response of this network 
to a distributed complex signal, we apply to each one of 
the input neurons a periodic external current with fre- 
quencies /i and ji. In response these neurons emit a 
sequence of spikes with inter-spike interval distributions 
centered at f\ and fa and with variances directly related 
to the noise intensities D\ and D 2 . 

Figure shows the behavior of the system in the ab- 
sence of noise (D\ = D2 = D3 = 0). In this deterministic 
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Morris-Lecar: Til 


Cm 


5 fj,F / cm 


9K 


8 /iS/cm 2 


9L 


2^S/cm 2 


QCa 


4.4^S/cm 2 


V K 


-80 mV 


V l 


-60 mV 


Vca 


120 mV 


Vmi 


-1.2 mV 


Vm2 


18 mV 


Vwi 


2mV 
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30 mV 


4> 


1/25 s" 1 


Parameters 


Synapsis 


a 




P 


0.1ms" 1 


Qsyn 


(specified in each case) 


T~syn 


(specified in each case) 


E s 


OmV 



TABLE I: Parameters values of the Morris- 
Lecar and synapse models used in this work. 



situation, the input neurons fire exactly with the frequen- 
cies at which they are modulated. If /1 = kfo (or, equiv- 
alently, T\ = To/k, where Tj is the period corresponding 
to the frequency /<) and f 2 = (k + l)f (T 2 = T /(k + l)), 
the two input neurons exhibit simultaneous spikes at in- 
tervals T = 1/ fo (provided the two harmonic signals are 
in phase), so that the synaptic current acting on the third 
neuron has maxima with the same frequency, as can be 
observed in Fig. ^c). In this example T\ = 100 ms and 
T 2 = 150 ms, so that k = 2 and T = 300 ms. Un- 
der these conditions, and for an adequate value of the 
coupling strength g syn , the processing neuron fires with 
frequency fo — 1/300 kHz, as shown in Fig. IHd). Hence, 
in this case a deterministic ghost resonance is observed. 

The previous example, however, is not realistic, since 
in normal conditions a neuron is affected by a substan- 
tial level of noise coming from, among other sources, the 
background activity of other neurons acting upon it. This 
causes a drift in the spike times and a broadening in the 
distribution of inter-spike intervals. We will now show 
that even in this case the missing fundamental frequency 
can be successfully detected, as was suggested in 0,|8| for 
a single neuron, even if the synaptic coupling g syn and the 
applied current in the output neuron, /3 PP , are slightly 
below the bifurcation threshold, so that the neuron does 
not fire in absence of noise (D3 = 0). 

With this in mind, we conduct a series of numerical ex- 
periments looking for the occurrence of ghost stochastic 
resonance. We choose /1 = 2 Hz and = 3 Hz, so the 
ghost resonance should be located at fo = 1 Hz. As is 
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FIG. 1: Deterministic response to a distributed harmonic 
complex signal. The membrane potential for the three neu- 
rons is shown: (a,b) input neurons, (d) processing neuron. 
The synaptic current acting on neuron 3, Il yn , is shown in 
plot (c). The two inputs neurons are fed with two sinusoidal 
signals of periods Ti = 150 ms and T2 = 100 ms respec- 
tively (which gives a ghost resonance of To = 300 ms). The 
synaptic coupling between input and processing neurons is 
g 3yn = 2 nS and T 3yn = 1 ms. All noise intensities are zero 
Di = D 2 = D z = 0. 



FIG. 2: Left panels: response of the processing neuron for 
increasing noise intensity: (a) mean time between spikes (Tp), 

(b) normalized variance of the distribution R p = a p / (T p ), and 

(c) fraction of pulses spaced around To = l//o, Ti = l//i 
and T2 = I//2 as a function of the noise amplitude in the 
processing neuron, D3. Right panels: probability distribution 
functions of the time between spikes T p for three values of the 
noise intensity D3: (d) D3 = 1, (e) D3 = 4, and (f) TJ3 = 8. 
Parameters are r syn = 35 ms and g ayn = 1 nS for the synapsis 
and we used fi=2 Hz and / 2 = 3 Hz (which gives /o = 1 Hz) 
for the sinusoidal signals in the input neurons. 



usual in neurophysiology, in order to quantify the behav- 
ior of the system, we follow the time between consecutive 
spikes, T p . In what follows, we analyze the first two mo- 
ments of the distribution of T p , its mean value (T p ) and 
its normalized variance R p = a p /{T p ). To estimate the 
coherence of the output with the frequencies of interest, 
we also compute the fraction f t o of inter-spike intervals 
in the neighborhood of T = l//o- The dependence of 
these variables (corresponding to the processing neuron) 
on the noise intensity -D3 is shown in Fig. [2J These 
results display a clear resonance at D3 ~ 4. The nor- 
malized variance of the inter-spike interval distribution 
[Fig. Hlb)] exhibits a minimum when the spikes of the 
third neuron are spaced, on average, (T p ) = 1000 ms [Fig. 
[2J a)]. Additionally, around 80% of the spikes are spaced 
±5% around T = 1000 ms for D 3 ~ 4 [Fig. EJc)]. These 
results clearly indicate that noise enhances the response 
of the processing neuron at the frequency /o, which is 
not present in the input neurons. 

The right panels of Fig. [21 show the probability distri- 
bution functions of the inter-spike intervals T p for three 
values of the noise in the processing neuron. For low 
noise intensities, the neuron spikes most likely when two 
input spikes arrive together, but with randomly one or 
more of these coincidence events is skipped. For this rea- 
son, the probability distribution function shows peaks 
centered at multiples of To, as it usually happens in con- 
ventional stochastic resonance j2^|. As the noise level 
increases skips occurs less frequently, until an optimal 
noise for which almost all spikes occur every To, i.e. at 
the missing fundamental frequency. For even larger noise 
intensities, spikes appear at the original input frequencies 



fi = l/Ti and f 2 = 1/T 2 , as can be observed in Fig. [2(f). 

In the binaural mechanism of ghost stochastic reso- 
nance described above, synaptic coupling obviously plays 
an important role, since the transfer of the input modu- 
lation from the sensory neurons to the processing neuron 
occurs synaptically. Taking into account that synaptic 
transmission is an intrinsically dynamical phenomenon 
(whose temporal behavior we are modelling explicitly), 
it is natural to expect that the characteristic time scale 
of this process will influence the occurrence of the reso- 
nance. Indeed, the results shown above correspond to an 
optimal value of the synaptic time r syn . As shown in Fig. 
[51 for fixed noise level -D3, a resonance in the response of 
the system to the missing fundamental is also observed 
with respect to r syn . 

We recall that r syn represents the time during which 
the neurotransmitters remain in the synaptic cleft before 
they start to disappear with rate /3, and it is a measure 
of the width of the pulses of the synaptic current re- 
ceived by the processing neuron. Therefore, for low T syn 
[Fig- Eld)] the synaptic pulses are very narrow, and hence 
coincidence detection is compromised. The characteris- 
tic probability distribution function in this case presents 
peaks at multiples of To, indicating that even if the noise 
level is optimized, coincident spikes from input neurons 
are skipped. 

As T syn increases the current pulses widen and coinci- 
dence detection improves, so that an optimal situation is 
reached for which the ghost resonance is very clear. But 
if we continue increasing the value of r syn the synaptic 
pulses become exceedingly wide and sequences of double 
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FIG. 3: Left panels: (a) mean time between spikes, (b) nor- 
malized variance of the inter-spike interval distribution, and 

(c) fraction of pulses around To = l//o and Td s = l//ds as a 
function of r syn . Right panels: Probability distribution func- 
tions of the inter-spike intervals T v for three values of T syn : 

(d) T syn = 5 ms, (e) r syn = 35 ms, and (f) T Byn = 150 ms. 
The value of g 3yn is different for each value of T syn ranging 



from g s 

for Tsyn 



2.50 nS for tv 



1.5 ms up to g ay 



1.00 nS 



~syn = 150 ms. fi = 2 Hz and fa = 3 Hz (which gives 
/o = 1 Hz) for the sinusoidal signals in the input neurons was 
used. 



spikes appear (spaced by Tds = l//ds)- This happens 
because noise can excite two spikes while the synaptic 
current remains high. Indeed, Fig. |3Jc) shows that the 
fraction of spikes occurring at intervals around Tds (±5%) 
begins to be important for r syn > 50 ms. The corre- 
sponding distribution function in Fig. [3ff)j shown here 
for T syn = 150 ms, corroborates this fact. 

The joint effect of the synaptic time r syn and the noise 
level D3 can be observed in the three-dimensional plots 
shown in Fig. QJ This figure shows R p and f t o as a 
function of both D3 and r syn . We can see the response 
of the processing neuron at the missing fundamental is 
most favorable when both parameters are simultaneously 
optimized. The normalized variance of the intcr-spikc 
interval distribution, R pi shows a clear minimum for 
T S yn ~ 35 ms and D ~ 4. For these same parameter 
values, the fraction of spikes fto occurring at intervals 
around Tq exhibits a maximum at almost 80%. 



/1 = kfo + A/, / 2 = (fc + l)/ + A/. 



(9) 



with k integer. In other words, fo is no longer the greatest 
common divider of /1 and fi, even though it's still their 
difference. If the system is simply detecting the difference 
/2 — fi, it should always display a fixed resonance at 
fo, independently of the frequency shift A/. But if the 
pitch detection does depends on A/, it will no longer be 
perceived as the difference between the input frequencies. 
This last situation is in fact what was found in human 
experiments |l it . The neural mechanism proposed in 0, 
shows that the frequency of the ghost resonance shifts 
linearly with A/ following the relation, 



fr = f + 



A/ 



k + 1/2 



(10) 



in agreement with the auditory experimental results of 
Refs. H|lS|l3. 

We now examine whether a scaling similar to that of 
Eq. H10|) is observed in the response of the processing 
neuron. We fix the noise intensity -D3 and synaptic time 
T syn to their optimal values at the resonance (D3 = 4, 
T S yn = 35ms) and compute the probability of observing a 
spike with rate f r for increasing A/. The results are plot- 
ted (in gray scale) in Fig. [5]as a function of /1, and show 
that the largest probability corresponds to spike rates fol- 
lowing the prediction of relation relation l|10|l . Changing 
the noise intensity only obscures the observation of the 
spike density, but it does not affect the agreement with 
the theoretical expression. In the bottom of Fig. [5] one 
can also see traces of less probable spikes, corresponding 
to a trivial subharmonic response of the system. 

Figure[S]shows that the processing neuron emits spikes 
following Eq. (|10fl for k = 2, 3, 4, 5. As mentioned above, 
this relation is sustained by experimental data of pitch 
detection [JjJ. Those experimental results indicate that 
equidistant tones in monoaural presentation do not pro- 
duce constant pitch, similarly to what we observe in our 
binaural numerical experiments. We are not aware of 
binaural human experiments shifting the frequency com- 
ponents as in |ll| , which would be interesting to compare 
with our numerical predictions in Eq. (|10|) . 



V. CONCLUSIONS 



IV. THE INHARMONIC CASE 

A paradigmatic experimental result in pitch perception 
refers to the pitch reported by human subjects to the 
presentation of a inharmonic complex sound, in which 
the originally harmonic components of the input are all 
shifted in frequency by a constant A/. In such a way the 
individual component are still separated in frequency by 
a constant missing "fundamental" fo, but are no longer 
multiples of it. The frequencies /1 and fi are chosen to 



In this paper we demonstrate the phenomenon of ghost 
stochastic resonance in a neural circuit where two neu- 
rons receive two components of a complex signal and 
their outputs drive a third neuron that processes the in- 
formation. The results show that the processing neu- 
ron responds preferentially at the "missing fundamen- 
tal" frequency, and that this response is optimized by 
synaptic noise and by synaptic time constant. The pro- 
cessing neuron is able to detect the coincident arrival of 
spikes from each of the input neurons, and this coinci- 
dence detection is analogous to the linear interference of 



FIG. 4: Left: fraction of pulses fto occurring at intervals T v equal (±5%) to the period of the ghost resonance (To = l//o). 
Right: normalized variance of the inter-spike interval distribution, R p . Both quantities plotted as function of noise intensity 
(D 3 ) and r syn . 
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FIG. 5: Probability of observing a spike in the processing 
neuron with instantaneous rate f r (in gray scale) as a function 
of the frequency f\ of one of the input neurons. We can 
observe a remarkable agreement of the responses following the 
lines predicted by Eq. il l (H for k — 2, 3, 4, 5 (dashed lines from 
top to bottom). Parameters: T syn = 35 ms, g syn = 1.0 nS, 
D 3 = 4.0, fi = 2Hz + A/, fa = 3Hz + A/. 



harmonic components responsible of the ghost response 



in the single-neuron case • A brain structure candidate 
for this dynamics is the inferior colliculus, which receives 
multiple inputs from a host of more peripheral auditory 
nuclei. Details of the physiology of this nucleus are still 
uncertain, but enough evidence suggests that temporal 
and frequency representations of the inputs are present 
in the spike timing of their neurons. Our results suggest 
that the neurons in this nucleus can exhibit the dynam- 
ics described here, thus participating in the perception 
of binaural pitch. The main consequence of these obser- 
vations is that pitch information can be extracted mono 
or binaurally via the same basic principle, i.e. ghost 
stochastic resonance, operating either at the periphery 
or at higher sensory levels. 
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